Extract Reliable Relations from Wikipedia Texts for Practical Ontology Construction

نویسندگان

Jin-Xia Huang

Kyung-Soon Lee

Key-Sun Choi

Young Kil Kim

چکیده

A feature based relation classification approach is presented in this paper. We aimed to exact relation candidates from Wikipedia texts. A probabilistic and a semantic relatedness features are employed with other linguistic information for the purpose. The experiments show that, relation classification using the proposed relatedness features with surface information like word and part-of-speech tags is competitive with or even outperforms the one of using deep syntactic information. Meanwhile, an approach is proposed to distinguish reliable relation candidates from others, so that these reliable results can be accepted for knowledge building without human verification. The experiments show that, with the relation classification approach presented in this paper, more than 40% of the classification results are reliable, which means, at least 40% of the human and time costs can be saved in practice.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

11th International Protégé Conference 2009

The focus of this research is the automatic extraction of an ontology of persons in Information Technology. Our approach involves the extraction of a categorization hierarchy of Wikipedia, the extraction of information about persons and the extraction of relations between persons. We have investigated the suitability of Wikipedia to extract social relations. Our research indicates that the info...

متن کامل

Leveraging Wikipedia Characteristics for Search and Candidate Generation in Question Answering

Most existing Question Answering (QA) systems adopt a type-and-generate approach to candidate generation that relies on a pre-defined domain ontology. This paper describes a type independent search and candidate generation paradigm for QA that leverages Wikipedia characteristics. This approach is particularly useful for adapting QA systems to domains where reliable answer type identification an...

متن کامل

Automatic Construction of Ontology from Arabic Texts

The work which will be presented in this paper is related to the building of an ontology of domain for the Arabic linguistics. We propose an approach of automatic construction that is using statistical techniques to extract elements of ontology from Arabic texts. Among these techniques we use two; the first is the “repeated segment” to identify the relevant terms that denote the concepts associ...

متن کامل

Automatic Topic Ontology Construction Using Semantic Relations from WordNet and Wikipedia

Due to the explosive growth of web technology, a huge amount of information is available as web resources over the Internet. Therefore, in order to access the relevant content from the web resources effectively, considerable attention is paid on the semantic web for efficient knowledge sharing and interoperability. Topic ontology is a hierarchy of a set of topics that are interconnected using s...

متن کامل

Automatic Topic Ontology Construction Using Semantic Relations from WordNet and Wikipedia

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

Computación y Sistemas

دوره 20 شماره

صفحات -

تاریخ انتشار 2016

Extract Reliable Relations from Wikipedia Texts for Practical Ontology Construction

نویسندگان

چکیده

منابع مشابه

11th International Protégé Conference 2009

Leveraging Wikipedia Characteristics for Search and Candidate Generation in Question Answering

Automatic Construction of Ontology from Arabic Texts

Automatic Topic Ontology Construction Using Semantic Relations from WordNet and Wikipedia

Automatic Topic Ontology Construction Using Semantic Relations from WordNet and Wikipedia

عنوان ژورنال:

اشتراک گذاری